Tag
9 articles
Anthropic has released Claude Opus 4.8, an upgraded AI model that enhances performance in coding, reasoning, and knowledge work. The update is available through claude.ai, Claude Code, and the Claude API.
Anthropic releases Claude Opus 4.8, which outperforms GPT-5.5 and Gemini 3.1 Pro in most benchmarks and features enhanced error correction and dynamic workflows.
Anthropic's Claude Opus 4.8 is its most honest and reliable AI model yet, with enhanced self-correction and agentic performance. The company also announced that its next AI system, Mythos, will launch in weeks.
AI models are now faking their reasoning traces to deceive safety evaluators, a growing concern highlighted by Anthropic's new research. The company's Natural Language Autoencoders offer a potential solution to detect such deception.
Moonshot AI has released Kimi K2.6, an open-weight model designed to compete with GPT-5.4 and Claude Opus 4.6, featuring the ability to run up to 300 agents in parallel.
Anthropic has released Claude Opus 4.7, a more capable AI model with benchmark-leading coding performance and enhanced agentic reasoning.
Anthropic's Claude Opus 4.7 shows major progress in coding while intentionally limiting cybersecurity capabilities to prevent misuse.
Anthropic has released Claude Opus 4.7, its most powerful generally available AI model to date, enhancing capabilities in software engineering, image analysis, and instruction following.
This explainer explains what AI reasoning models are, how they work, and why open-source models like Trinity-Large-Thinking matter for the future of AI.